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Abstract 

The conventionalistic aspects of physical world perception are reviewed 
with an emphasis on the constancy of the speed of light in relativity theory 
and the irreversibility of measurements in quantum mechanics. An appendix 
contains a complete proof of Alexandrov's theorem using mainly methods 
of affine geometry. 

1 Know thyself 

This inscription on the Oracle of Apollo at Delphi, Greece, dates from 6th cen- 
tury B.C., and it is still of tremendous importance today. For we do not and never 
will see the world "as is," but rather as we perceive it. And how we perceive 
the world is mediated by our senses which serve as interfaces to the world "out 
there" (if any); but not to a small extend also by what we project onto it. Con- 
ventions are projections which we have to adopt in order to be able to cope with 
the phenomena "streaming in" from the senses. Conventions are a necessary and 
indispensable part of operationalizableQ phenomenology and tool-building. There 

' In what follows we shall adopt Bridgman's concept of "operational" as one of quite simple- 
minded experimental testability, even in view of its difficulties which this author approached later 
on [15 
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is no perception and intervening without conventions. They lie at the very founda- 
tions of our world conceptions. Conventions serve as a sort of "scaffolding" from 
which we construct our scientific worldview. Yet, they are so simple and almost 
self-evident that they are hardly mentioned and go unreflected. 

To the author, this unreflectedness and unawareness of conventionality appears 
to be the biggest problem related to conventions, especially if they are mistakenly 
considered as physical "facts" which are empirically testable. This confusion 
between assumption and observational, operational fact seems to be one of the 
biggest impediments for progressive research programs, in particular if they sug- 
gest postulates which are based on conventions different from the existing ones. 

In what follows we shall mainly review and discuss conventions in the two 
dominating theories of the 20th century: quantum mechanics and relativity theory. 

2 Conventionality of the constancy of the character- 
istic speed 

Suppose two observers called Alice and Bob measure space and time in two coor- 
dinate frames. Operationally their activities amount to the following. They have 
constructed "identical" clocks and scales of "equal" length which they have com- 
pared in the distant past; when Bob lived together with Alice. Then they have 
separated. Alice has decided to depart from Bob and, since then, is moving with 
constant speed away from him. How do Bob's and Alice's scales and clocks com- 
pare now? Will they be identical, or will they dephase? 

These are some of the questions which "relativity" theory deals with. It de- 
rives its name from Poicare's 1904 "priciple of relativity" stating that p. 74] 
"the laws of physical phenomena must be the same for a stationary observer as 
for an observer carried along in a uniform translation; so that we have not and 
can not have any means of discerning whether or not we are carried along in 
such a motion." Formally, this amounts to the requirement of form invariance or 
covariance of the physical equations of motion. 

One of the seemingly mindboggling features of the theory of relativity is the 
fact that simultaneity and even the time order of two events needs no longer be 



2 



conserved. It may indeed happen that Alice perceives the first event before the 
second, while Bob perceives both events as happening at the same time; or even 
the second event ahead of the first. Simulaneity can only be defined "relativ" to a 
particular reference frame. If true there, it is false in any different frame. 

The first part of Einstein's seminal paper is dedicated to a detailed study 
of the intrinsically operational procedures and methods which are necessary to 
conceptualize the comparison of Alice's and Bob's reference frames. This part 
contains certain "reasonable" conventions for defining clocks, scales, velocities 
and in particular simultaneity, without which no such comparison could ever be 
achieved. These conventions appear to be rather evident and natural, almost triv- 
ial, and yield a convenient formalization of space-time transformations, but they 
are nevertheless arbitrary. The simultaneity issue has been much debated in the 



contemporary discussions on conventionality [H, ^ |lOp. 



There is another element of conventionality present in relativity theory which 
has gotten less attention [[TTl]. It is the assumption of the constancy of the speed 
of light. Indeed, the International System of units assumes light to be constant. 
It was decided in 1983 by the General Conference on Weights and Measures that 
the accepted value for the speed of light would be exactly 299,792,458 meters 
per second. The meter is now thus defined as the distance traveled by light in a 
vacuum in 1/299,792,458 second, independent of the inertial system. 

Despite the obvious conventionality of the constancy of the speed of light, 
many introductions to relativity theory present this proposition not as a conven- 
tion but rather as an important empirical finding. Indeed, it is historically correct 
to claim that experiments like the ones of Fizeau, Hoek and Michelson-Morley, 
which produced a null result by attempting to measuring the earth's motion against 
some kind of "ether," preceded Einstein's special theory of relativity. 

But this may be misleading. First of all, Einstein's major reason for intro- 
ducing the Lorentz transformation seems to have been the elimination of asym- 
metries which appeared in the electromagnetic formalism of the time but are not 
inherent in the phenomena, thereby unifying electromagnetism. Secondly, not too 
much consideration has been given to the possibility that experiments like the one 
of Michelson and Morley may be a kind of "self-fulfilling prophesy," a circular, 
closed tautologic exercise. If the very instruments which should indicate a change 
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in the velocity of light are themselves dilated, then any dilation effect will be ef- 
fectively nullified. This possibility has already been imagined in the 18th century 
by Boskovich [ |12[ J and was later put forward by FitzGerald | J13| J (see also John 
Bell [0, |15]]), Lorentz, Poincare and others in the context of the ether theory 

But what is the point in arguing that the constancy of the speed of light is a 
convention rather than an empirical finding? Is this not a vain question; devoid of 
any operational testability? 

The answer to this concern is twofold. First, a misinterpretation might give 
rise to a doctrinaire and improper preconception of relativity theory by limiting 
the scope of its applicability. Indeed, as it turns out, for reasons mentioned below 
[[TT|], the special theory of relativity is much more generally applicable as is nowa- 
days appreciated. It applies also to situations in which the velocity of light needs 
not necessarily be the highest possible limit velocity for signaling and travel. Sec- 
ondly, it is not totally unreasonable to ask the following question. What if one 
adopts a different convention by assuming a different velocity than that of light to 
be the basis for frame generation? Such a velocity may be anything, sub- but also 
superluminal. What will be the changes to Alice's and Bob's frames, and how do 
these new coordinates relate to the usual "luminal" frames? 

These issues have been discussed by the author [ pT] ] on the basis of a geo- 
metrical theorem by Alexandrov [[T^, |T7|, [rS|, |T9p and Borchers and Hegerfeldt 
[ ]20| ] reviewed in [ plj |22| ] (see also previous articles [|2^, ^). Alexandrov's the- 
orem requires the convention that some speed is the same in Bob's and Alices's 
frames. Furthermore, if two space-time points are different in Alice's frame, then 
these points must also be mapped into different points in Bob's frame and vice 
versa; i.e., the mapping must be one-to-one, a bijection. It can be proven that 
under these conditions, the mapping relating Alice's and Bob's frames must be 
an affine Lorentz transformation, with some fundamental speed playing the role 
of light in the usual Lorentz transformations of relativity theory. The nontrivial 
geometric part of a proof uses the theorem of affine geometry, which results in the 
linearity of the transformation equations. No Poicare's 1904 "priciple of relativ- 
ity," no relativistic form invariance or covariance is needed despite the postulate 

^ In stressing the conventionality aspect of tliese effects, the author would like to state that he 
does not want to promote any ether-type theory, nor is he against any such attempts. 
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or convention of equality of a single speed in all reference frames. The deriva- 
tion uses geometry, not physics. The Appendix contains a detailed derivation of 
Alexandrov's theorem which should be conprehendible for a larger audience. 

To repeat the gist: it is suggested that the signalling velocity occuring in the 
Lorentz transformation is purely conventional. This effectively turns the interpre- 
tation of relativity theory upside down and splits it into two parts, one geometric 
and one physical, as will be discussed next. 

So where is the physics gone? The claim of conventionality arouses suspi- 
cions. The proper space-time transformations cannot be purely conventional or 
even a matter of epistemology! After all, the Michelson-Morley experiment and 
most of its various pre- and successors actual yielded null results, which are valid 
physical observations as can be. The experimenters never explicitly acknowledged 
the conventionality of the constancy of the speed of light and approved their instru- 
ments according to these specifications. Just on the contrary, they first assumed 
to measure the unequality and anisotropy of the speed of light. And what if Alice 
and Bob assume, say, the constancy of the speed of sound instead of light? Would 
the mere assumption change the reading of the instruments in a Michelson-Morley 
experiment using sound instead of light? This seems to be against all intuitions 
and interpretations and the huge accumulated body of evidence. 

The answer to these issues can be sketched as follows. First of all, the physics 
is in the form invariance of the electromagnetic equations under a particular type 
of Lorentz transformations: those which contain the speed of electromagnetic 
signals; i.e., light, as the invariant speed. Thus, merely the convention of the con- 
stancy of the speed of light in all reference frames yields the desirable relativistic 
covariance of the theory of electromagnetism. This is a preference which cannot 
be motivated by geometry or epistemology; it is purely physical. 

However, any such Lorentz transformation will result in a non-invariance of 
the theory of sound or any other phenomena which are not directly dominated 
by electromagnetism. There, an asymmetry will appear, singling out a particular 
frame of reference from all the other ones. 

Thus, one may speculate that the most efficient "symmetric" representation 
of the physical laws is by transformations which assume the convention of the 
invariant signalling velocity which directly reflects the phenomena involved. For 



5 



electromagnetic phenomena it is the speed of electromagnetic waves; i.e., light. 
For sound phenomena it is the speed of sound. For gravity it is the speed of 
gravitational waves. Thus the conventionality of the relativity theory not only 
relativizes simultaneity but must also reflect the particular class of phenomena; 
in particular their transmission speed(s). In that way, a general form invariance 
or covariance is achieved, satisfying Poicare's 1904 "priciple of relativity" men- 
tioned above, which is not only limited to electromagnetism but is valid also for a 
wider class of systems. 

Secondly, it is not unreasonable to assume that in the particular context of 
the Michelson-Morley and similar experiments, all relevant physical system pa- 
rameters and instruments are governed by electromagnetic phenomena and not 
by sound, gravity or something else. Thus, although not explicitly intended, the 
experiments are implicitly implementing the conventionality of the constancy of 
light. Of course, the experimenter could decide to counteract the most natural way 
to gauge the instruments and measure space and time differently than as suggested 
by the intruments. For instance, one may adopt scales to measure space which are 
anisotropic and velocity dependent. But this would be a highly unreasonable, 
unconvenient and complicating thing to do. 

So, from a system theoretic standpoint, the proper convention suggests itself 
by the dominating type of interaction, and only in this way corresponds to a phys- 
ical proposition. The result is a generalized principle of relativity. 

3 Conventionality of quantum measurements 

In what follows the idea is put forward and reviewed that measurements in quan- 
tum mechanics are (at least in principle) purely conventional. More precisely, 
it is purely conventional and subjective what exactly an "observer" calls "mea- 
surement." There is no distinction between "measurement" and ordinary (uni- 
tary) quantum evolution other that the particular interpretation some (conscious?) 
agent ascribes to a particular process [p5|]. Indeed, the mere distinction between 
an "observer" and the "object" measured is purely conventional. Stated pointedly, 
measurement is a subjective illusion. We shall call this the "no measurement" 
interpretation of quantum mechanics. 
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The idea that measurements, when compared to other processes (involving en- 
tanglement), are nothing special, seems to be quite widespread among the quan- 
tum physics community; but it is seldomly spelled out publicly Indeed, 
the possibility to "undo" a quantum measurement has been experimentally doc- 
umented [^, while it is widely acknowledged that practical bounds to maintain 
quantum coherence pose an effective upper limit on the possibility to reconstruct 
a quantum state. We shall not be concerned with this upper bounds, which does 
not seem to reflect some deep physical truth but rather depends on technology, 
financial commitments and cleverness on the experimenter's side. 

Rather, we shall discuss the differences between the two types of time evolu- 
tion which are usually postulated in quantum mechanics: (i) unitary, one-to-one, 
isometric time evolution inbetween two measurements and (ii) many-to-one state 
reduction at the measurement. 

Inbetween two measurements, the quantum state undergoes a deterministic, 
unitary time evolution, which is reversible and one-to-one. This amounts to ar- 
bitrary generalized isometrics — distance-preserving maps — in complex Hilbert 
space. A discrete analogue of this situation is the permutation of states. An "initial 
message" is constantly being re-encoded. As such an evolution is reversible, there 
is no principle reason why any such evolution cannot be undone. (There may be 
insurmountable practical obstacles, though.) 

Any irreversible measurement is formally accompanied by a state reduction or 
"wave function collapse" which is many-to-one. Indeed, this many-to-oneness is 
the formal mathematical expression of irreversibility. 

What is a measurement? Besides all else, it is associated with a some sort of 
"information" transfer through a fictious boundary between some "measurement 
apparatus" and the "object." In the following we shall call this fictious boundary 
the "interface." The interface has no absolute physical relevance but is purely 
conventional. It serves as a scaffolding to mediate and model the exchange. In 
principle, it can be everywhere. It is symmetric: the role of the observer and the 
observed system is interchangeable and a distinction is again purely conventional. 

In more practical terms, it is mostly rather obvious what is the observer's side. 
It is usually inhabited by a conscious experimenter and his measurement device. 
It should be also in most cases quite reasonable to define the interface as the 
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location where some agent serving as the experimenter looses control of one-to- 
onenness. This is the point where "the quantum turns classical." But from the 
previous discussion it should already be quite clear that any irreversibility in no 
way reflects a general physical principle but rather the experimenter's ability to 
reconstruct previous states. Another "observer" equipped with another technology 
(or just more money) may draw very different interface lines. 

Let me add one particular scenario for quantum information. Assume as an 
axiom that a physical system always consists of a natural number of n quanta 
which are in a single pure state among q others. Any single such particle is thus 
the carrier of exactly one ^-it, henceforth called "quit." (In the spin-one half case, 
this reduces to the bit.) That is, encoded in such a quantum system are always n 
quits of information. The quit is an irreducible amount of classical and quantum 
information. The quits need not be located at any single particle (i.e., one quit per 
particle), but they may be spread over the n particles [|2^]. In this case one calls the 
state of the particles "entangled." According to Schrodinger's own interpretation 
[]30|], the quantum wave function (or quantum state) is a "catalogue of expectation 
values" about this state; and in particular about the quits. Since an experimenter's 
knowledge about a quantum system may be very limited, the experimenter might 
not have operational access to the "true" pure state out there. (In particular, it 
need not be clear which questions have to be ask to sort out the valid pure state 
from other ones.) This ignorance on the experimenter's side is characterized by 
a nonpure state. Thus one should differentiate between the "true" quantum state 
out there and the experimenter's "poor man's version" of it. Both type of states 
undergo a unitary time evolution, but their ontological status is different. 

Why has the no-measurement interpretation of quantum mechanics been not 
wider accepted and has attracted so little attention so far? One can only speculate 
about the reasons. 

For one thing, the interpretation seems to have no operational, testable conse- 
quences. Indeed, hardly any interpretation does. So, what is any kind of interpre- 
tation of some formalism good for if it cannot be operationalized? 

Think of the Everett interpretation of quantum mechnics, which is neverthe- 
less highly appreciated among some circles, mainly in the quantum computation 
community. It has to offer no operationalizable consequences, just mindboggling 
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scenarios. 

Or consider Bohr's "Copenhagen" interpretation, whatever that means to its 
successors or to Bohr himself. It is the canonical interpretation of quantum me- 
chanics, a formalism co-created by people, most notably Einstein, Schrodiner and 
De Broglie, who totally disagreed with that interpretation. This does not seem to 
be the case for Heisenberg and von Neumann. The latter genius even attempted to 
state an inapplicable theorem directed against hidden parameters to support some 
of Bohr's tendencies. Nowadays, many eminent researchers in the foundations 
of quantum mechanics still stick with Bohm's interpretation or whatever sense 
they have made out of it. But does Bohr's "Copenhagen" interpretation have any 
operational consequences? 

With the advent of quantum information theory, the notion of information 
seems to be the main interpretational concept. Consequently, information inter- 
pretations of quantum mechanics begin to be widespread. Yet, despite the heavy 
use of the term "information," the community does not seem to have settled upon 
an unambiguous meaning of the term "information." And also in this case, the 
interpretations do not seem to have operational consequences. 

Many recent developments in quantum information theory are consistent with 
the no-measurement interpretation. Unitarity and the associated one-to-onenness 
even for one quantum events seems to be the guiding principle. Take, for example, 
the no-cloning theorem, quantum teleportation, entanglement swapping, purifica- 
tion and so on [ |3T1 , |32| ]. Actually, the no-measurement interpretation seems to 
promote the search for new phenomena in this regime, and might thus contribute 
to a progressive research program. 

Indeed, it is the author's belief that being helpful in developing novel theories 
and testing phenomena is all one can ever hope for a good interpretation. Any 
"understanding" of or "giving meaning" to the formalism is desireable only to 
the effect that it yields new predictions, phenomena and technologies. And in this 
sense, the no-measurement interpretation claiming the conventionality of quantum 
measaurements should be perceived. It too cannot offer direct operationalizable 
consequences, yet may facilitate thoughts in new, prosperous directions. 
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4 Summary 



We have reviewed conventions in two of the dominating theories of contemporary 
physics, the theory of relativity and quantum mechanics. In relativity theory we 
suggest to accept the constancy of one particular speed as a convention. Lorentz- 
type transformation laws can then be geometrically derived under mild side as- 
sumptions. In order for a generalied principle of relativity and thus generalized 
form invariance to hold, the particular signalling type entering the transformations 
should correspond to the dominating type of physical interactions. 

The no-measurement interpretation of quantum mechanics suggests that there 
is no such thing as an irreversible measurement. In fact, there is no measurement 
at all, never. This kind of irreversibility associated with the measurement process 
is just an idealistic, subjective construction on the experimenter's side to express 
the for-all-practical-purposes impossibility to undo a measurement. 

Appendix. Proof of Alexandrov's theorem 

Alexandrov's theorem states that, for M", n>3 with the metric diag(-f, -|-, -|-, ■ ■ ■ , -|-, — ) 
and a one-to-one map r ^ r' preserving light cones (i.e., zero distance) such that 
forallr,5GM", 

{r — s,r — s) =0 <^=^ (/ — s , r — /) = 0, 

r / is essentially a Lorentz transformation; i.e., it has the form r^r' = aLr + a 
for some nonzero a G M. a E M", and a linear one-to-one map L : M" i-^ 
satisfying (Lr.Ls) = {r,s) for all r and s in W\ 

In what follows we shall review a complete proof of Alexandrov's theorem 
very similar to the one sketched by Lester P^. The proof consists of three stages: 

(I) a proof that, given M", n > 3 with the metric diag(+, +,•••,+,—) and a one- 
to-one map preserving light cones (i.e., zero distance), all lines are mapped 
onto lines; 

(n) a proof of the fundamental theorem ofqffine geometry stating that a one-to- 
one map from R", n > 2 onto itself which maps all lines onto lines must be 
affine; i.e., must be a linear map and a translation; 
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(Ill) a proof that, given W\n>2 with the metric diag(+, •■■,+,—) and a linear 
one-to-one map preserving a single light cone (i.e., zero distance) must be 
essentially a Lorentz transformation (up to a translation and a dilatation); 
i.e., it has the form r ^ aLr + a for some nonzero a G M. a G M", and a 
linear one-to-one map L:W t-^ satisfying {Lr,Ls) = {r,s) for all r and s 
inM". 

In what follows, a constant translation is taken account of by addition of a 
vector a E W\ The remaining transformation preserves the origin; i.e., t-^ 0'. 
We shall often refer to this remaining transformation (after the constant parallel 
shift moving the map of the origin back into the origin) simply as (homogeneous) 
transformation. (Note that if f : r t-^ aLr + a, then the homogeneous part os ob- 
tained by subtracting a = /(O).) This constant shift a has to be added to the final 
mapping. 

The geometric proof of (I) proceeds in five steps, covering the mapping of 
(i) lightlike (null) lines onto lightlike lines; (ii) lightlike (null) planes onto null 
planes; (iii) spacelike lines onto spacelike lines; (iv) timelike planes onto planes; 
and finally (v) timelike lines onto lines. 

In what follows, the configurations are demonstrated for M? with the metric 
(r, s) = risi + r2S2 — ( 1/ c^)r3S3. For arbitrary dimensions we refer to pi]]. In this 
section, the velocity of light c will be set to unity; i.e., c = 1. The terms "null" 
and "lightlike" will be used synonymously. 

To show (i) let us first define a null cone with vertex a by 

C{a) = {reM?\{r-a,r-a)=0]. 

By assumption, light cones are preserved, i.e., C{a) ^ C{a'). 

As illustrated in Fig. |I](a), any null (lightlike) line is the intersection of two 
tangent null cones. Since null cones are preserved, so are null (lightlike) lines. 
Thus, null lines are mapped into null lines. The same is true for the inverse map. 
Hence, null lines are mapped onto null lines. (The same is true for the other proof 
steps as well but will not be mentioned explicitly.) 

To show (ii), notice that, as illustrated in Fig. [T](b), a null cone with vertex on 
some null plane is tangent to that plane along a null line. Points of are on the 
null plane if and only if they either lie on this null line or on no null cone with 
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Figure 1 : Illustrations of the proof that (a) lightlike (null) lines into lightlike lines; 
(b) lightlike (null) planes into null planes; (c) spacelike lines into spacelike lines; 
(d) timelike planes map into planes. 
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vertex on this line. The latter sentence could be understood as follows. Imagine 
any point of M? outside of the null plane (either "below" or "above"). Any such 
point is element of some null cone with vertex on the null line mentioned. On the 
contrary, any point on the null plane cannot be reached by such null cones (except 
the ones located on the null line mentioned), but by other null cones whose vertex 
is not on that null line. Null lines and cones are preserved; thus null planes are 
preseved as well. 

To show (iii), notice that, as illustrated in Fig. |l](c), any spacelike line is the 
intersection of two null planes. Since null planes are preserved, spacelike lines 
are preserved. 

To show (iv), notice that, as illustrated in Fig. [T](d), the points in a timelike 
plane are covered by infinitely many intersecting null and spacelike lines in that 
plane. By fixing, for instance, a triangle formed by the vertices a, b, c of three such 
lines (e.g., two null and one spacelike line) "spans" the timelight plane. Because 
of the one-to-oneness of the mapping, the image of the triangle with the vertices 
a', y , c' "spans" the transformed plane (different points are mapped onto different 
points). Therefore, the three lines forming the transformed triangle must be copla- 
nar. In general, the images of all lines lying in the original timelight plane must 
be coplanar. Thus, timelike planes map into planes. 

To show (v), notice that any timelight line is the intersection of two timelight 
planes. Since timelike planes are mapped onto planes, they intersect into a line. 
Thus, any timelike line is mapped into a line. 

In summary, all three types of lines — lightlike (null), spacelike and timelike 
lines — are mapped onto lines. (Recall that the same arguments apply for the in- 
verse transformation as well.) 

The geometric proof of (II), in particular the linearity of the transformation 
proceeds from the preservation of lines essentially by utilizing the preservation of 
parallelism among lines. As will be demonstrated below, the preservation of par- 
allelism implies that the transformation is additive. The associated transformation 
of the field M is an automorphism. It then only remains to be proven that the only 
automorphism of M is the identity function. 

Let us first introduce some notation. For a much more comprehensive ap- 
proach the reader is refered to the literature (e.g., the book by Gruenberg & Weir 
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p3|]). Let a be a fixed "translation" vector of R" and M be a linear subspace of 
M". [Recall that a subset 5 C M" is called a (linear) subspace if 5 is a vector space 
in its own right with respect to the same vector addition and scalar multiplication 
than W.] Then a + M denotes the set of all vectors a + M = {a + m\ m E M}. 
It is called translated subspace or coset or affine subspace of M". The dimen- 
sion of a translated subspace a + M is, the dimension of the linear subspace M; 
i.e., dim(a + M) = dim(M). Translated subspaces of dimensions 0, 1,2 are called 
points, lines and planes, respectively. Let the join Si o 52 of two translated sub- 
spaces 5i,52 be the intersection of all translated subspaces in M" which contain 
both 5i and 52. (The join is again a translated subspace.) Furthermore, if 5 C M" 
is any set of vectors in M", we denote by the (linear) span span(5) the intersection 
of all the subspaces of M" which contain 5. 

We shall call an automorphism a one-to-one mapping of M" onto itself pre- 
serving all translated subspaces. The fundamental theorem of affine geometry 
(e.g., ref. Theorem 5]) states that, for R", n>2, any automorphism induces 
a linear transformation L and a translation vector a such that r ^ r' = aLr + a. 

In what follows, a proof of the fundamental theorem of affine geometry will 
be given for the case of the vector space R", n > 2 with field R. First, a proof 
will be given that any such automorphism of R" implies an automorphism on the 
field of reals R (a definition will be given below). By invoking the preservation of 
parallelism one obtains both the uniqueness of the associated mapping of the field 
R onto itself and furthermore the additivity of the transformation as a whole. 

Note that the automorphism preserves parallellism. This can be seen by "fix- 
ing" appropriate four points a,b,c,d on two lines which are originally parallel, 
drawing two nonparallel lines through them which meet in another point e. Since 
by assumption all lines are preserved, so are their meeting points a',b' ,c',d' . Fur- 
thermore, because of bijectivity, two parallel lines have no point in common. 
Thus, the two lines which are originally parallel are mapped onto colpanar lines 
which are disjoint; i.e., they are again parallel. Hence, parallelism is conserved. A 
concrete configuration illustrating this geometrical argument is drawn in Fig. ^ 

Consider an arbitrary nonzero vector a E R". According to the assumptions, 
any line o a = span(a) is transformed into a line 0' oa' = span(a'), thereby in- 
ducing a one-to-one mapping of all points of span(a) onto the points of span(a'). 
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Figure 2: Geometrical proof of the preservation of parallelism due to the preser- 
vation of lines. 

That is, the transformation defines a one-to-one mapping 

^■.x^x' (1) 

of the field of real numbers onto itself by the definition 

{xay = x'a'. (2) 

It immediately follows that ^ : i-^ 0' as well as ^ : 1 i-^ 1'. It will be shown that ^ 
is an automorphism; i.e., a one-to-one mapping of M onto itself with the properties 
that !;{x+y) = + i;{y), as well as i;{xy) = ^^^(j). 

First it is shown that C, does not depend on the particular choice of a G M". 
(i) Case 1: Consider two linearly independent vectors a,b of M", (xa)' = x! d and 
[xh)' =x"b' , x' ^x" . Since = 0' = 0", one can assume that jc 7^ 0. The join ;cao;cZ7 
is the intersection of all the subspaces of M" containing both xa and xb. Since xa 
and xb are vectors, this is just the subspace spanned by the line joining them. 
xaoxb is parallel io aob. (Rescaling does not affect parallelism; cf. Fig. |3|.) The 
transformation preserves parallelism, and therefore a' o b' must also be parallel to 
x'aox'b' and x'aox"b' , the lines connecting x'a with x'b' and x'a with x"b' . This 
can only be satisfied for x' = x" . Hence, ^ is independent of the argument and 
only depends on the transformation. 

(ii) Case 2: Consider two linearly dependent vectors a,b of M". In this case, 
choose a third vector c which does not lie in the linear subspace span(a) spanned 
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Figure 3: Rescaling does not effect parallelism. 

by a and b. Then, by the argument used in case 1, ^ is the same for a, c and b, c; 
thus C, is also the same for a, b. Hence, to sum up the finding in the two cases, C, is 
independent of the argument vector and only depends on the transformation. 

We shall pursue the proof that, given the preservation of lines, the associated 
mapping is additive (up to translations). A geometric interpretation of this proof 
is drawn in Fig. ^. (i) Case 1 : If a and b are linearly independent nonzero vectors 
(the zero vector case is trivial) of M", then the parallelogram a,0,b,a+bis mapped 
into the parallelogram a' ,0' ,b' ,a' + b' and 

{a + b)' = a' + b'. (3) 

This is also true if a ox b is the zero vector. 

(ii) Case 2: If a and b are linearly dependent and nonzero, choose a third 
vector c ^ span(a) (so that c is linearly independent of a and b), and apply the 
above considerations for the pairs a-\-b & c rendering a + b + c ^ (a + b)' + c', a 
&b + c rendering a + b + ci-^a' + {b + c)', b & c rendering b + c ^-^ b' + c', such 
that {a + b)' + c' = a' -\-b' -\- c' , which is satisfied only if again {a + b)' = a' + b' . 

Two further properties assuring that ^ is an automorphism can be deduced 
from the uniqueness of Eq. (0) and Eq. (0) and the usual axioms of linear vector 
spaces, (i) Automorphism property 1: Let a' ^ 0, then for all x,y e M, 

(jc+y)V = {{x + y)a\ = {xa + yd)' = {xd)' + {yd)' =y!cl +y' d = (/ + /)«' (4) 

and thus 

{x + y)'=^+y'. (5) 
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Figure 4: Geometrical proof of the preservation of any parallelogram and of 
additivity due to the conservation of parallelism. 

(ii) Automorphism property 2: By the assumption of vector spaces, {xy)a — x{ya) 
for all jc,y e K and a e R". Therefore, 

(xy)'a' = x'{yay = x (y'a) = {x'y')a' (6) 

and thus 

(xy)' = xy. (7) 

In order to complete the proof of linearity, it will be shown that the only au- 
tomorphism of the field R into itself is the identity function id : x i— > x. This can 
be demonstrated by realizing that the algebraic properties of neutral elements 0, 1 
with regard to addition and multiplication have to be preserved; i.e., i— >^ and 
1 I— i> 1 . Furthermore, since 1 has to be preserved and any natural number n G N 
is the sum of n I's, N has only a single automorphism-the identity function id. 
A very similar argument holds for Z. Since any element of the positive rationals 
can be represented by the quotient n/m with n,m G N, again Q has only a single 
automorphism-the identity function id. In order to be able to obtain the same re- 
sult for R, one has to make sure the Dedekind construction of the reals works; in 
particular the preservation of Dedekind sections. This requires the preservation 
of the order relation "<" in R, which is equivalent to the preservation of positiv- 
ity, since x <y can always be rewritten into <y — x. Notice that every positive 
< X G R can be written as x = y^, j G R 3^ 7^ 0. Since x = is mapped onto 
y = = (y)2 with y 7^ (recall that ^ 0), / > 0. This allows the Dedekind 
construction of the reals using the rationals, which in turn yields the desired fact 
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that M has only a single automorphism-the identity function id. (This is not true 
for example for C, since for example x + iy ^-^ x — iy is an automorphism but not 
the identity.) 

A way to get rid of the factor a is by considering the tangent hyperboloid 
x^ +y'^ — = 1 of the null cone x^ +y'^ — = 0, translating it once and then 
back to the original figure. The requirement that this should result in the same 



hyperboloid fixes a [ |34| ] 



We shall now concentrate on a proof of (III). Let us first note that, in the case 
of a linear map, the preservation of a single light cone is a sufficient condition for 
the preservation of all of them. For, given the transformation x ^ aLx + a, any 
shift of the null cone C (p) with vertex p by a vector s = q — p results in a null 
cone C{q) = C (p) + s with vertex q = p + s. The latter null cone C (q) is mapped 
onto the null cone 

C (q) ^ aLC (q) + a = aL{c{p)+s)+a 
= aLc (p) + aLs + a 
= {aLc (p) +a)+ aLs 
= c{p')+aLs, 

which again is a null cone. 

Recall that in Einstein's original work par 3], linearity was never derived 
but was assumed for physical reasons. "In the first place it is clear that the equa- 
tions must be linear on account of the properties of homogeneity which we at- 
tribute to space and time." [["Zundchst ist klar, dafi die Gleichungen linear sein 
miissen wegen der Homogenitdtseigenschaften, welche wir Raum und Zeit bei- 
legen."]] In what follows we shall closely follow Einstein's original argument 
rendering L to be the Lorentz transformations. 

Take the standard four dimensional space-time case M^, and consider, for the 
sake of simplicity, the quasi-twodimensional case (one space and the time coordi- 
nate) of the constant motion along the jc-axis of K with velocity v of a coordinate 
frame K' with the components {x' ,y' ,z! ^t') against another coordinate frame "at 
rest" K with the components {x.y.z^t). (Otherwise, K can be rotated such that the 
direction of motion is along the x axis.) Again, c stands for the velocity of light. 
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M' x' 



XQ = Ajc 



X 



(a) 



(b) 



Figure 5: Generation of radar coordinates by a light clock following Einstein's 
procedures and conventions (a) from within the system K'; (b) the same procedure 
seen from the system K. 

Now define a particular series (in time) of points x = x — vt. Notice that the 
"worldlines" {x = , 0, 0, t) just mark the parametrization by the time parameter t 
of all points at rest with respect to the moving frame K' . That is, any such point 
has constant x,y,z throughout all times t. It is sometimes convenient (cf. below) 
to write the parameters of events in terms of {x.y.z^t) instead of (;c,y,z,f). 

Let us construct "radar coordinates" of K' by utilizing a light clock starting at 
some arbitrary point x = at traveling some distance M to a mirror, where it 
arrives and is instantly reflected at ^'-time t[ towards the original source mirror 
and arrives there at ^'-time ?2 [cf. Fig. ||(a)]. If one adopts the usual conventions 
for synchronization, t[ is just the arithmetic mean of the two times and i-C-, 



In order to find the transformation mapping K onto K' , rewrite the trans- 
formed coordinates as functions of the original system; e.g., t' = t\x,y,z,t). In 




(8) 
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this parametrization, the coordinates are given by 



f'(0,0,0,0, 
/ ( M,0,0,? + 



Ax 



(-2 _ y2 



, , Ax Ax 

0,0,0,?+^ ^ + 



(9) 
(10) 

(11) 



where 



h 



Ax 



9 9 : 



Ax Ax 
t + -. T + 



results from the following consideration. The ^-time A?i =t\—tQ it takes for light 
to arrive at the first mirror is given by the total distance it takes for light to travel 
to it, divided by the velocity of light. Since the mirror travels with velocity v, 



Ari 



Ax + vA?i Ax 



c — V 



(12) 



A similar argument yields 



A?i =t2-t\ 



Ax — vAfi Ax 



C + V 



Inserting Eqs. (g)-(|ni) into (g) yields 
M \ 1 



t' Ax,0,0,? + 



(2 _ y2 



/, ^ / / Ax Ax 
?'(0,0,0,0+? 0,0,0,?+^ ^ + 



(-2 _ y2 c2 _j_ yl 



( 



3) 



Ax can be arbitrarily small. A partial derivation of (|T3]) by ^ in the limit of 
infinitesimal Ax yields 



1 



^ 1 _ 1 



(14) 
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and thus 

dt' 1 dt' ^ 

Likewise, (dt'/dy) = (dt'/dz) = 0. As a result of this and Eq. (p3|), t' must be a 
linear function of t and x of the form 

t'{x,y,zj) = a{v)(^t---^^x^ . (16) 

a(v) is a yet arbitrary scale factor depending only on v. Note that, without loss 
of generality, the origins of K and K' has been chosen such that t = t' = 0. By 
substituting the explicit parameters for x = x — vt one obtains 

t'{x,y,z,t)=a{v)—^(^t-^xy (17) 

1 ^ 



The transformation rules of the / parameter can be obtained by considering 
the propagation of a light ray in K' which starts at the origin of K and K' (same 
origins) and moves along the x- and /-axes. The convention of the constancy of 
the speed of light requires 

X = ct' = a(v)c — 2—^^ ■ (18) 

Now recall that, in terms of the ^-parameters, this propagation of this light ray is 
given by Eq. ([T2|); i.e., hy t = x/{c — v) (the differences A can be omitted because 
of the ray starting at the coordinate origins). By substituting t in ( [T8| ) one obtains 

1 _ 1 

X = a(v)^= = a(v) = a(v) ^{^-vt). (19) 

^ ' c^-v^ ' ' 1 _ 4 ' ' 1 _ 4 ' 

Let us now turn to the transformation of coordinates y,z perpendicular to 
the direction of motion x. Consider a light ray propagating along the y'-axis, 
and hence x = 0. Inside the system K, the j-component of the light propaga- 
tion follows from the Pythagorean theorem, which is illustrated in Fig. |^; i.e., 
Vy = a/c^ — v^. Hence, 



/ = ct' = a(v)c — 2~~~2-^ 



(20) 
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Figure 6: Velocity Vy of a light ray propagating along the positive y' axis of a 
system traveling with velocity v along the x- and /-axes. 



for ;c = and t —y/vy = y/Vc'^ — v'^ 



y = oc(v) 



■ y- 



(21) 



The same consideration applies to the transformation of the z- and z'-axes. Sum- 

Lx given by 



ming up, we obtain a transformation of the coordinates x' 



( 



L(v) = a(v) 







1-^ 







1- 



72- 



1 







1 












1- 



3 



(22) 



We now fix the factor a(v) by the conventional requirement that a back- 
transformation should recover the original coordinates. For this purpose we in- 
vent a third coordinate frame K" which propagates with the reverse (relative to 

velocity — v (measured in K) along the x-, and its /'-axes. The successive 



application of the transformation ( |22| ) with L(v) and L(— v) should bring back the 
coordinates to their original form; i.e., 



L(v)L(-v) = I4, 



(23) 



where I4 = diag(l, 1,1,1) stands for the four-dimensional unit matrix. After eval- 
uating the matrix product and comparing the coefficients, one obtains 

a(v)a(-v) = 1-^. (24) 
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That a(v) = Oi{ \v\) only depends on the absolute value of the velocity can be seen 
by symmetry and isotropy arguments. For the length /' of a rod {p' G | jc' = 
0, <y' z' = 0} which is at rest along the /-axis with respect to the system 
K' traveling along the x-axis should not depend on the direction of motion; i.e., 
should only depend on the absolute magnitude of the velocity. If this is granted, 
one obtains 



a(v) = 

and finally the transformation laws jc i— > 



1 



Lx with 



(25) 



L(v) = 












1 







1 



\ 







(26) 



up to constant translations a e R . As can be easily checked, L preserves the 
distance of any two points; i.e., {Lr,Ls) — (r, s) for all r and s in R^. 

It would be nice to have a more general result using a more general metric 
and/or relaxation of bijectivity. 



Acknowledgments 

I have discussed the ideas exposed here with many researchers in numerous dis- 
cussions in Viennese coffee-houses and elsewhere. I am particularly indebted to 
Hans Havlicek. Almost needless to say, I take the full responsibility for all errors 
or misconceptions (if any). I would like to make it quite clear that I have not 
attempted to "disprove" relativity theory in any way. Just on the contrary, these 
considerations attempt to extend the domain of validity of relativity theory even 
further rather than discredit it. The extensions deal with universes which are dom- 
inated by interactions which are mediated through velocities different (and maybe 
also higher than) the velocity of light. 



23 



References 



[1] p. W. Bridgman. The Logic of Modern Physics. New York, 1927. 

[2] P. W. Bridgman. A physicists second reaction to Mengenlehre. Scripta 
Mathematica, 2:101-117, 224-234, 1934. Cf. R. Landauer [|35|]. 

[3] P. W. Bridgman. The Nature of Physical Theory . Princeton, 1936. 

[4] P. W. Bridgman. Reflections of a Physicist. Philosophical Library, New York, 
1950. 

[5] P. W. Bridgman. The Nature of Some of Our Physical Concepts. Philosoph- 
ical Library, New York, 1952. 

[6] Arthur I. Miller. Albert Einstein's special theory of relativity. Springer, New 
York, 1998. 

[7] Albert Einstein. Zur Elektrodynamik bewegter Korper. Annalen der Physik, 
17:891-921, 1905. English translation in the Appendix of [§]. 

[8] D. Malament. Causal theories of time and the conventionality of simultani- 
ety. Nous, 11:293-300, 1977. 

[9] Michael Redhead. The conventionality of simultaneity. In J. Earman, A. Ja- 
nis, G. Massey, and N. Rescher, editors. Philosophical Problems of the In- 
ternal and External Worlds, pages 103-128. University of Pittsburgh Press 
and Universitatsverlag Konstanz, Pittsburgh and Konstanz, 1993. 

[10] Allen I. Janis. Conventionality of simultaneity. In Stanford Encyclo- 
pedia of Philosophy. Winter 1998 Edition. The Metaphysics Research 
Lab Center for the Study of Language and Information. Ventura Hall 
Stanford University Stanford, CA 94305-4115, 1998. Internet address: 
http://plato.stanford.edu/entries/spacetime-convensimul/ . 

[11] Karl Svozil. Relativizing relativity. Foundations of Physics, 30(1):\00\- 
1016, 2000. e-print |arXiv:quant-ph/0001064 . 



24 



[12] R. J. Boskovich. De spacio et tempore, ut a nobis cognoscuntur. Vienna, 
1755. English translation in [B^. 



[13] George Francis FitzGerald. In J. Larmor, editor, The Scientific Writings of 
the Late George Francis FitzGerald. Dublin, 1902. 

[14] John S. Bell. How to teach special relativity. Progress in Scientific Culture, 
1(2), 1976. Reprinted in [^ pp. 67-80]. 

[15] John S. Bell. George Francis FitzGerald. Physics World, 5(9):31-35, 
September 1992. Abridged version by Denis Weaire. 

[16] A. D. Alexandrov. On Lorentz transformations. Uspehi Mat. NauL, 
5(3):187, 1950. 

[17] A. D. Alexandrov. A contribution to chronogeometry. Canadian Journal of 
Math., 19:1119-1128, 1967. 

[18] A. D. Alexandrov. Mappings of spaces with families of cones and space-time 
transformations. Annali die Matematica Pura ed Applicata, 103:229-257, 
1967. 

[19] A. D. Alexandrov. On the principles of relativity theory. In Classics of Soviet 
Mathemotics. Volume 4. A. D. Alexandrov. Selected Works, pages 289-318. 
1996. 

[20] H. J. Borchers and G. C. Hegerfeldt. The structure of space-time transfor- 
mations. Communications in Mathematical Physics, 28:259-266, 1972. 

[21] Walter Benz. Geometrische Transformationen. BI Wissenschaftsverlag, 
Mannheim, 1992. 

[22] June A. Lester. Distance preserving transformations. In Francis Buekenhout, 
editor. Handbook of Incidence Geometry. Elsevier, Amsterdam, 1995. 

[23] Karl Svozil. Connections between deviations from lorentz transformation 
and relativistic energy-momentum relation. Europhysics Letters, 2:83-85, 
1986. excerpts from [03]. 



25 



[24] Karl Svozil. Operational perception of space-time coordinates in a quantum 
medium. IlNuovo Cimento, 96B: 127-139, 1986. 

[25] Karl Svozil. Quantum interfaces. In Sciences of the Interface, pages 
76-88, Tub ingen, Germany, 2001. Genista Verlag. e-print a rXiv:quant- 
ph/0001064 available http://arxiv.org/abs/quant-ph/0001064. 

[26] Daniel B. Greenberger and A. YaSin. "Haunted" measurements in quantum 
theory. Foundation of Physics, 19(6):679-704, 1989. 

[27] Daniel M. Greenberger. Solar Eclipse Workshop, Vienna, August 1999. 

[28] Thomas J. Herzog, Paul G. Kwiat, Harald Weinfurter, and Anton Zeilinger. 
Complementarity and the quantum eraser. Physical Review Letters, 
75(17):3034-3037, 1995. 

[29] Niko Donat h and Karl Svozil. Finding a state in a haystack. a rXiv:quant- 
ph/0105046, 2001. 

[30] Erwin Schrodinger. Die gegenwartige Situation in der Quantenmechanik. 
Naturwissenschaften, 23:807-812, 823-828, 844-849, 1935. English 
translation in [^ and [^ pp. 152-167]; a copy can be found at 
www. emr.hibu . no/lars/eng/cat/. 

[31] Josef Gruska. Quantum Computing. McGraw-Hill, London, 1999. 

[32] M. A. Nielsen and I. L. Chuang. Quantum Computation and Quantum In- 
formation. Cambridge University Press, Cambridge, 2000. 

[33] K. W. Gruenberg and A. J. Weir. Linear Geometry (2nd Edition). Springer 
Verlag, New,York,Heidelberg,Berlin, 1977. 

[34] Hans Havlicek. private communication. 

[35] Rolf Landauer. Advertisement for a paper I like. In John L. Casti and J. F. 
Traub, editors. On Limits, page 39. Santa Fe Institute Report 94-10-056, 
Santa Fe, NM, 1994. 



26 



[36] R. J. Boskovich. De spacio et tempore, ut a nobis cognoscuntur. In J. M. 
Child, editor, A Theory of Natural Philosophy, pages 203-205. Open Court 
(1922) and MIT Press, Cambridge, MA, 1966. 

[37] John S. Bell. Speakable and Unspeakable in Quantum Mechanics. Cam- 
bridge University Press, Cambridge, 1987. 

[38] Karl Svozil. On the setting of scales for space and time in arbitrary quantized 
media. Lawrence Berkeley Laboratory preprint, LBL-16097, May 1983. 

[39] J. D. Trimmer. The present situation in quantum mechanics: a translation 
of Schrodinger's "cat paradox". Proc. Am. Phil. Soc, 124:323-338, 1980. 
Reprinted in [BDl pp. 152-167]. 



[40] John Archibald Wheeler and Wojciech Hubert Zurek. Quantum Theory and 
Measurement. Princeton University Press, Princeton, 1983. 



27 



